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Abstract — An analytical framework for performance analysis 
and optimization of coded V-BLAST is developed. Average power 
and/or rate allocations to minimize the outage probability as 
well as their robustness and dual problems are investigated. 
Compact, closed-form expressions for the optimum allocations 
and corresponding system performance are given. The uniform 
power allocation is shown to be near optimum in the low 
outage regime in combination with the optimum rate allocation. 
The average rate allocation provides the largest performance 
improvement (extra diversity gain), and the average power 
allocation offers a modest SNR gain limited by the number 
of transmit antennas but does not increase the diversity gain. 
The dual problems are shown to have the same solutions as the 
primal ones. All these allocation strategies are shown to be robust. 
The reported results also apply to coded multiuser detection and 
channel equalization systems relying on successive interference 
cancelation. 

Index Terms — Multi-antenna (MIMO) system, spatial mul- 
tiplexing, coded V-BLAST, power/rate allocation, performance 
analysis 



I. Introduction 

TO exploit the impressive spectral efficiencies of wire- 
less communication systems with multiple antennas at 
both the transmitter and receiver (T), the V-BLAST algo- 
rithm was proposed 10. Its simple transmission and detection 
mechanisms as well as its ability to achieve a significant 
portion of the MIMO capacity have made the V-BLAST a 
popular solution for MIMO signal processing. In this paper, 
we consider zero-forcing (ZF) V-BLAST, which relies on 
the successive interference cancelation (SIC) to decode the 
spatially-multiplexed data streams at the receiver. Because of 
the SIC, the algorithm suffers from the error propagation effect 
so that the overall error performance is dominated by that of 
the 1st stream (with low diversity order) [3]|4||5]|6|, which 
may not be satisfactory. While the optimal ordering procedure 
provides some improvement in SNR (the SNR gain of the 
ordering equals to the number of transmit antennas at high 
SNR in i.i.d. Rayeligh fading), the system diversity order is 
not improved and is still limited by that of the 1st stream (i.e. 
the lowest one) |4]|7||8], which may be unsatisfactory. 

Several techniques have been reported to improve the error 
performance of the uncoded V-BLAST by employing a non- 
uniform power allocation among the transmitters. In particular, 
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Il9l iflOl ifTTl explore the transmit (Tx) power allocation that 
minimizes the instantaneous (i.e. for given channel realization) 
error rate of the uncoded V-BLAST, with or without the 
optimal ordering. For such optimization, a new feedback 
session and power reallocation are needed each time the 
channel changes due to small scale or multipath fading. A 
less demanding approach is to find an optimum allocation 
of the average power based on the average error rate, a 
strategy we refer to as "average optimization". Since this 
ignores small-scale fading, only occasional feedback sessions 
and power reallocations are required, when the average SNR 
changes due to large-scale fading or shadowing, and only the 
average SNR needs to be fed back to the transmitter. Average 
power allocation for the uncoded V-BLAST has been explored 
in ll3l 1151 lfl2l . While its performance is slightly inferior to 
the instantaneous power allocation, it does offer a few dB 
improvement in the SNR and achieves the same gain equal to 
the number of transmit antennas at high SNR [12]. The SNR 
gain of any optimum power allocation is upper bounded by the 
number of transmit antennas and therefore it does not improve 
the diversity gain (provided that all the streams stay active, 
which is required to support high rate) lfl"2"l . Another way to 
reduce error rate of the V-BLAST is via the fixed-complexity 
sphere decoder, which, under certain scenarios, demonstrates 
near-maximum likelihood performance with a fixed number of 
operations lfl3l . 

While the studies above deal with the uncoded V-BLAST, 
most practical communication systems use coding; uncoded 
systems are rare. A coded V-BLAST OFDM system has been 
considered in [14] via the system capacity analysis and an im- 
proved detection scheme to reduce the impact of error propa- 
gation has been presented, assuming uniform power allocation. 
An instantaneous optimization of power, rate and antenna map- 
ping for a coded ZF V-BLAST to minimize the total transmit 
power for given data rate under a zero-outage constraint has 
been presented in [15], assuming capacity-achieving codes or 
realistic ones via the SNR gap to capacity. The optimization 
is performed by a numerical algorithm exploiting the problem 
convexity. While this approach allows significant improvement 
in system performance, it also requires instantaneous feedback 
and computations, which increases the system complexity. Due 
to the numerical nature of the solution, only limited insight is 
available. The SNR-asymptotic diversity-multiplexing tradeoff 
(DMT) analysis of the ordered V-BLAST with optimum rate 
allocation (to optimize the DMT) is presented in lfl6l . While 
this analysis provides some insight into the SNR asymptotics 
of the performance, it is not clear what the finite-SNR impli- 
cations are. 



2 



In this paper, we use an analytical approach to optimization 
and performance analysis of coded ZF V-BLAST, which 
provides significant new insights at finite SNR. To reduce 
the demand on system resources, we consider average op- 
timization of power and rate allocation and provide closed- 
form solutions. Following the earlier work in [14|[15| and 
also the general philosophy advocated in lT7llfT8l , we assume 
that temporal capacity-achieving codes are used for each data 
stream of the V-BLAST, so that all per-stream rates up to 
the stream capacity can be supported with zero probability 
of error. This is motivated by the fact that there are powerful 
codes (LDPC, turbo-codes) that operate very close to capacity. 
Realistic codes are accounted for via the SNR gap to capac- 
ity, as in 03) ED. This model of coded V-BLAST allows 
analytically-tractable optimization and performance analysis 
of the algorithm. 

We perform analysis and performance evaluation of the 
following three optimization strategies, which minimize the 
outage probability under the constraints on the total power 
and rate: 

• average power allocation (APA), which is motivated by 
the fact that many practical system use power control, 

• average rate allocation (ARA), which is suitable for 
variable-rate system using identical and fixed power am- 
plifiers to simplify the RF part of the system, 

• joint average power and rate allocation (APRA), which 
is suitable for variable-rate variable-power systems. 

Compact, closed-form expressions for the optimized powers 
and rates are obtained and their error rate performance and 
robustness are investigated. Our approach is to use analytical 
techniques as much as possible and to resort to numerical 
techniques only for validation purposes. 

The following novel insights are obtained: 

> The APA does not change the system diversity order, 
but offers an SNR gain, which is upper bounded by the 
number of transmit antennas; the upper bound is achieved 
in the low outage regime. This is similar to the uncoded 
V-BLAST Ifl2l . 

• The ARA improves the diversity order and thus is much 
more efficient than the APA in the low outage regime. 

• While the APRA offers a performance improvement in 
high-to-moderate outage regime, it has the same perfor- 
mance as the ARA in the low outage one. Surprisingly, 
the uniform power allocation across the active streams is 
optimum in this regime. 

> All these optimization strategies are robust in terms 
of rate and/or power variations (with power allocation 
demonstrating better robustness than rate allocation), 
which makes them good candidates for practical systems. 

• Dual problems of minimizing the total power or maximiz- 
ing the total rate under the outage probability constraint 
have the same solutions as the primal ones. 

Comparing the 3 optimization strategies, we conclude that 
the ARA offers the largest incremental increase in the perfor- 
mance, with the APA offering a modest SNR gain over the 
unoptimized system and the APRA offering an improvement 
over the ARA in the high-to-moderate outage regime. 



Due to similar system architectures and processing strate- 
gies, most of these results also apply to multiuser detection 
and inter-symbols interference equalization systems that use 
successive interference cancellation. 

All analytical results and approximations are validated via 
simulations. 

The paper is organized as follows. Section Ull introduces the 
basic system model, assumptions and optimization strategies. 
Sections [Til] - [V] derive and analyze the optimum power, 
rate and joint power/rate allocations for the coded V-BLAST, 
section [VI] studies their robustness and section I VIII considers 
the dual problems. Section [Villi concludes the paper. 

II. System Model and Outage Probability 

Analytical performance evaluation of the V-BLAST is a 
challenging task even when no coding is used [3||4]|5||6], 
mainly because of the successive interference cancelation. 
Adding realistic codes to that would make the problem an- 
alytically intractable. On the other hand, many state-of-the 
art codes (e.g. turbo-codes, LDPC) operate very close to the 
capacity (within a fraction of a dB), so that assuming capacity- 
achieving codes is practically relevant and it also makes the 
analysis significantly simpler in many cases. This approach 
has been successfully exploited in [ 1411 151 IT7lll[T8l and will be 
applied here to the coded V-BLAST analysis and optimization. 

In particular, we study ZF V-BLAST without optimal order- 
ing but with optimized average power and/or rate allocation 
and capacity-achieving temporal codes for each stream so that 
the maximum possible rate equals to the capacity of that 
stream. Since capacity-achieving codes are used, there are no 
errors if the stream is not in outage and roughly half of the 
bits are in errors during an outage, so that the overall bit error 
rate is BER w \P ut 111 where P out is the outage probability, 
i.e. the probability that the system capacity is less than the 
target rate. There is no error propagation when all streams are 
not in outage, which also simplifies the analysis significantly. 

The following standard baseband discrete-time MIMO sys- 
tem model is employed, 

Em 
h iv /o^ + £ (1) 
i— 1 

where s = [s\, s 2 , ...s m ] T and r = [n, r 2 , ...r„] T are the 
vectors representing the Tx and Rx symbols respectively, "T" 
denotes transposition, H = [hi, ti2, ...h m ] is the nxm matrix 
of the complex channel gains between each Tx and each Rx 
antenna, where denotes i-th column of H, n and to are the 
numbers of Rx and Tx antennas respectively, n > to, £ is the 
vector of circularly-symmetric additive white Gaussian noise 
(AWGN), which is independent and identically distributed 
(i.i.d.) in each receiver, A = diag U/cti, ■ ■ ■ , ^/o; m ), where di 
is the power allocated to the i-th transmitter. For the regular 
V-BLAST, the total power is distributed uniformly among 
the transmitters, a,\ = a.% = ... = a m = 1. The channel 
is assumed to be non-ergodic ("slow block fading") and the 
key performance measure is the outage probability for given 
target bit rate [17]. Details of a mathematical model of the 

'this approximation is due to the fact that the transition from low to high 
error rate regime is very sharp in capacity-approaching codes (see e.g. | 20 1) 
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uncoded V-BLAST, on which our model of the coded V- 
BLAST is based, and its analysis can be found in [3||4||6|[12| 
and are not repeated here. After the interference cancelation 
from already detected symbols and interference nulling from 
yet-to-be detected symbols, the equivalent scalar channel of 
i-th stream is 



(2) 



where h,u. is i-th column of the channel matrix projected 
onto the subspace perpendicular to that spanned by yet-to-be- 
detected symbols (i.e. by [hj+i, hi + 2, ...h TO ]). This channel 
can support all the rates up to its instantaneous capacity 



Ci = ln(l 



ai|h^| 2 7o) [nat/s/Hz], 



(3) 



where 70 is the average SNR, so that the system outage 
probability is 



Pout = l-l[(l-Pr{C l <R l }y i 



(4) 



where Ri is the stream fixed target rate (which may be a 
function of the average SNR only), and the power allocation 
a = [a\ . . . a m ] is the function of the average SNR 70 only. 
Pr{C,; < Ri] is the outage probability of i-th stream, and the 
system outage takes place when at least one of the streams 
is in outage, i.e. is not able to support its target rate. In i.i.d. 
Rayleigh fading channels, different |h^| 2 are distributed as 
X2(n-m+i) (chi-squared with 2(n— m+i) degrees of freedom) 
and are independent of each other [3] 16], so that the outage 
probability of i-th stream is equal to the outage probability of 
(n — m + i)-th order maximum ratio combiner (MRC): 

P t = Pr{a < Ri} = Pr{ln(l + aj :|h l± | 2 7 o) < R,} 



a*7o 



(5) 



(n — m + i)\ 



ai7o 



n — m-\-i 



< 1 



where Fk(x) = 1 — e~ x ^2^q x l /l\ is the outage probability 
of fc-th order MRC. In the low outage regime, which is of 
practical interest, Ri < ln(l + 0^70) and Pi <C 1, so that 
can be approximated by 1 st order terms only 



Pi 



E 



(n — m + i)l \ <Xi7o 



n— m+z 



(6) 



This is the approximation we use throughout the paper 
(Lemma 6 in Appendix shows that an optimization preserves 
the approximation accuracy, which justifies this approach). 
We note that while the approximations similar to those in 
(O© also hold for the uncoded system HE 021, there 
is a significant difference: the low outage regime implies 
high SNR (70 S> 1) for the uncoded system, but in the 
coded system the SNR may also be low in this regime as 
long as Ri < ln(l + 01*70) (for example, it holds for multi- 
user interference-limited systems, where the interference is 
considered to be a part of noise, e.g. CDMA systems 1 1 7|). 
This observation also applies to the results below which are 
obtained via these approximations. 



With realistic rather than capacity-achieving codes, the 
channel in ® supports all the rates up to ln(l + 
on |h;J 2 7o/r), where T is the SNR gap to capacity lfT9llTT5ll 
so that all our results will also apply with the substitution 
7o -> 7o/T, i.e. with the "effective" SNR 70/r. 

With a fixed power and rate allocation, including the un- 
optimized system (Ri — R, on = a), the diversity order of 
i-th stream is n — m + i so that 1st stream has the lowest 
diversity order n — m + 1 and, thus, this stream provides 
the dominant contribution to the system outage probability 
in the low outage regime, with vanishingly small contribution 
coming from higher-order streams, 



Pn 



Pi > P 2 > ... > P„ 



(7) 



and the overall system diversity order is also n — m + 1. 
This parallels the corresponding result for the uncoded system 
00. 

In this paper, we consider an average optimization, i.e. 
an optimum power and/or rate allocation is found based on 
channel statistics and stays the same as long as the average 
SNR stays the same so that the system tracks only large-scale 
channel variations (similarly to the uncoded system [3|[12|), 
which reduces the demand on system resources and feedback 
channel. 

The following optimization strategies are considered below: 
optimum average (per-stream) power allocation (APA), opti- 
mum average (per-stream) rate allocation (ARA), and joint 
average power and rate allocation (APRA), all to minimize 
the outage probability for given total data rate and power. 

III. Average Power Allocation 

In this section, we consider the optimum average power allo- 
cation (APA) among the streams with uniform rate allocation, 

Ri = R\ 



min P ou t subject to \^ oii 

cy — 4 



(8) 



i=0 



The solution of this problem can be characterized in the 
following way. 

Theorem 1. In the low outage regim^ R < ln(l + 70), the 
optimum allocation of average power to minimize P ou t (i-e. 
the problem in (0) can be approximated as 



m 



i=2 



(9) 



1 



7o 



, i = 2 ... to, 



where the numerical coefficients bi are given by 
i n ~™+ 2 ( n - to)! 



(n — m 
Proof: see Appendix lAl 



1) 



(10) 



2 Note that R < ln(l + 70) is the low outage regime for the unoptimized 
system. As the optimization can only decrease outage probability, the approx- 
imation in (5) is always valid for a system in which and/or Ri have been 
optimized in one way or another. 
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In the unoptimized system, 1st stream error rate has the 
smallest diversity order n — m + 1, which increases with 
the stream number (see (O), so that the outage probability is 
dominated by 1st term at the low outage regime, P out ~ Pi, 
and the APA allocates most of the power to this stream to 
reduce P ou t, with progressively smaller amounts to higher- 
order streams, which can be formalized as follows. 

Corollary 1. The optimum power allocation in (O behaves at 
the low outage regime as follows, 



ajsim» a*... > a* m 



(11) 



i.e. most of the power goes to 1-st stream, with vanishingly 
small portions to higher-order streams^ 

The error rates of the optimized system are obtained by 
combining (O and ©: 



Pi 



1 



(n — m + 1)! 

(i-l)(n-m + i) 



1 



TO7o 

e R -l 



n— m+1 



(12) 



7o 



where di is i-th stream diversity order, 

_ (n - m + i)(n - m + 2) 
n — m + i + 1 

Note that di + i > di so that 1st stream dominates the outage 
with vanishingly smaller contribution from higher-order ones 
so that the following corollary holds. 

Corollary 2. The error rates with the APA behave in the same 
way as in the unoptimized system at the low outage regime, 



Pi* » P 2 * > ... > P* 



(14) 



To quantify the performance improvement of the APA, we 
use the SNR gain G of optimum power allocation, which is 
defined as the difference in the SNR required to achieve the 
same error rate in the unoptimized and optimized systems [ 12 1, 



Pout (at, a* m ) = P out (G, G) 



(15) 



Lemma 1. The SNR gain G of either average or instantaneous 
power allocation is bounded, for any fading distribution, as 
follows: 



1< G < m 



(16) 



In the i.i.d. Rayleigh fading channel, the upper bound is 
achieved at the low outage regime, when R < ln(l + 70), 
via the APA in Theorem Q] or via an instantaneous power 
allocation that minimizes Pout- 
Proof: The lower bound follows from the fact that opti- 
mized system cannot perform worse than the unoptimized one. 
The upper bound is a gain of a hypothetical system in which 
every stream enjoys m-fold boost in power, on — m. The 
power-constrained optimized system cannot perform better 
than that, since in this case the total available power equals 
m, so that on < m, and P ou t is decreasing in each a;. The 

3 This is similar to the case of uncoded V-BLAST in |31 |12I . 



achievability part of the APA follow from the comparison of 
(fT2l . (fl4l to (©, ©. Since instantaneous optimization cannot 
perform worse than average, this also proves the achievability 
part of the instantaneous optimization. ■ 
The power allocation in (0 approaches the maximum SNR 
gain of m in the low outage regime. Note that the bounds in 
(TToT l are the same as for the unoptimized V-BLAST in lfT2l : in 
either coded or uncoded V-BLAST system, power allocation 
cannot bring in a SNR gain greater than m. It thus follows 
that there is no additional diversity gain associated with the 
APA in both cases. The diversity gain can be found from 1 17| 



lim In P otlt / 11170. 



To- 



(17) 



or, when a closed-form expression for P ou t is available (e.g. 
(fT2b). by inspection. The following corollary follows immedi- 
ately from Lemma [T] 

Corollary 3. The average APA does not provide any ad- 
ditional diversity gain over the unoptimized system for any 
fading distribution. In the i.i.d. Rayleigh-fading channel, 



d A 



PA 



1 



where d u = n — m+1 is the diversity gain of the unoptimized 
system and the target rate R is fixed. □ 

Example. To get some insight, consider the coded 2 x 2 V- 
BLAST. The unoptimized outage probabilities are given by 



Pi 



- 1 



> P 2 



To 



1 



To 



(18) 



The stream diversity gains are d\ 
power allocation is given by 

ait fa 2 - at > aX ? 



1 < c?2 = 2. The optimum 



4.3 



To 



(19) 



The resulting optimized outage probabilities are given by 



p* 

out 



Pi 



270 



» p 2 * 



1 



2-4f 



To 



(20) 



and the diversity gains are d\ = 1 < d\ = 4/3. The 
optimization reduces the outage probability of the dominating 
1st stream by a factor of 2 so that P* ut ps P «{/2. However, 
the lower power allocated to the 2nd transmitter results in 
decreased diversity order at the 2nd stream, compared to the 
unoptimized system, d\ < — 2, yet P* 3> P 2 so that P* 
is still dominant in the low outage mode. The system outage 
probability for various fixed rates is plotted in Fig. Q] The 
approximation © of optimum a* exhibits good accuracy over 
the whole SNR range. The SNR gain of APA increases with 
the SNR and approaches 3 dB (as predicted by (TToTl). which 
is the same as the SNR gain of the optimal ordering with the 
uniform power/rate allocation [4|[8 |. 

4 Note that the diversity gain is the same as for the uncoded system 
[6][T2|. For coded system (with capacity-achieving codes), this result has 
been obtained (in a different way) in I17II16I . 
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Fig. I. The system outage probability of 2 X 2 V-BLAST in i.i.d. Rayleigh 
fading channel with and without power allocation for various fixed rates. The 
closed-form a* in |9j provides good accuracy over the whole SNR range. 



IV. Average Rate Allocation 

In this section, we study the optimum average rate allocation 
(ARA) assuming the uniform power allocation, on = 1. The 
optimization problem is formulated as follows: 



min Pq. 
subject to 



(21) 



; Ri = mR, Ri > 0, 



so that the total system target rate is mR and individual rates 
are adjusted to minimize the system outage probability. The 
following Lemmas provide some insight into the problem and 
are also instrumental in finding the optimal rate allocation. 

Lemma 2. In the low outage regime R < ln(l + 70), the 
optimization problem in (12 1 b is convex and thus has a unique 
solution. 

Proof: see Appendix iBl ■ 

Lemma 3. In the low outage regime, the optimum rates in the 
optimization problem above satisfy 



R\< #2 • • • < ^C-l < Kn 



(22) 



Proof: see Appendix 151 



Lemma 4. In low outage/moderate rate regime 1 < R < 
ln(l + 70), the optimum per-stream outage probabilities at 
streams i and j are related as (n — m + i)P* — (n~m+j)P* 
provided that both streams are active, so that P* > P* for 
i < j, i.e. lower-order streams contribute more to the system 
outage probability. 

Proof: see Appendix IBl ■ 
Lemma |4] implies that the diversity orders at each stream 
are equal in the ARA, unlike the APA. Note the waterfilling 
analogy here: since the optimum is achieved when stream error 
rates are roughly equal, the rate allocation algorithm "pours" 
more rate to the streams with smaller initial error rate (higher 
diversity order streams) in order to make their error rate larger 



and less rate to lower order streams in order to make their error 
rate smaller. If some lower order streams are too weak, no rate 
is allocated to them. 

We are now in a position to characterize the solution to the 
optimization problem in d2U . 

Theorem 2. In low outage/moderate rate regime 1 < R < 
ln(l + 70), an approximate solution to the optimization prob- 
lem in ( 1271 ) is given by: 



Ri 



In 70 



mR — mA In 7o 



Ci 



where 



b(n — m + i) 
max[0, ■], and a, b and Ci are given by 



(23) 



In (n — mA + i — 1) 



i=i 

In (n — m 



n — mA + i 
i - I V. - 



n — m,A 



n — m-\- 1 

and mA is the number of active transmitters, determined from 



m A = dxgmmP out (R*\ mA =k) 

k—l:m 



(24) 



Proof: see Appendix FBI ■ 
Note. Another way to find tjia is based on Lemma [3} 

m A « argmax{fc : R* m _ k+l \ mA =k > 0} (25) 



fe=l:- 



While this is less accurate than ( 124-b in some cases, it provides 
an additional insight unavailable from the latter. 



Using ©, and P* 



obtains the rate-optimized outage probability 



P^ (from Lemma |4j, one 



P* 

out 



E 



-PL = nbPl 



(26) 



b(n — in a + rn 



1)! 



To 



(n-1)! 

where b is as in d23l . 

It follows from Theorem 2 that the larger the target rate 
mR, the more transmitters stay active with the optimal rate 
allocation. If we begin to increase the SNR while keeping 
the total rate fixed, eventually only one stream will remain 
active at high SNR. While this approach, analyzed by Prasad 
and Varanasi in Q, achieves full receive diversity, it does 
not exploit the entire available capacity. On the other hand, 
if we set mR = C w mln(l + 70) to maximize the rate, 
where C is the ergodic system capacity, the outage probability 
becomes large, P out ~ \. To balance the two extremes, we 
follow ifTTl and introduce the multiplexing gain < r < 
m, where r = corresponds to a fixed rate at high SNR, 
and set R = — ln(l + 70) ~ ^11170, which represents an 
adaptive transmission system where the data rate is directly 
linked to the available average SNR. Smaller r decreases the 
outage probability, while larger r permits higher transmission 
rates. Thus by adjusting the multiplexing gain, we control the 
rate-outage probability tradeoff. For this setting, the following 
corollary follows from Lemma [4] and ( f26b . 
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Corollary 4. In the low outage/moderate rate regime 1 < 
R = — In 70 < ln(l + 70), the diversity gains at each stream 
are equal to the system diversity gain dARA- 

>'> Mi.x = 1 — > ; ) fi - —) (27) 



roj-1 



which can be bounded as 

(n-niA + l) (l — ) < d ARA < n ( 1 

where the upper bound is achieved when m A — 1. 

When optimized over m^i, the DMT in (|27| i is the same as 
that in lfl6l for the fixed ordering with rate allocation. Note 
however that our result holds at finite SNR while that in lfl6l 
was derived for SNR — » 00 (see [21] for a detailed discussion 
of limitations of the SNR-asymptotic DMT). 

Example. Consider the 2x2 system, n = m = 2. With 
R = I ln(l +70) ~ I ln7o> both transmitters are active when 
the rate is high: 



1 

r> 2 



31n3 



(28) 



4I1170 

where the inequality is from (124-b and the equality is from 
(|25| |. Clearly, the latter agrees with the former at high SNR 
and the critical multiplexing gain, above which both streams 
are active, is 1/2. Another way to use this condition is 

which gives an SNR threshold above which both streams are 
active for given r. Under these conditions, the optimal rates 
are given by 

1 

— L 

3 

where AR = In 70 — R represents the per-stream rate ad- 
justment that minimizes the system outage probability. The 
optimized outage probabilities are 



R — — AR, i?2 



R 



Iar, 



(30) 



P* 



„, 3 V 1 2) p * ^ 1 p * p* ^ 3 „ 

7o i r 2 ~ o 1 ' out ~ o 1 



(31) 



(1 - r/2) > d J 4PA(^) = 1 - r/2. This V. JOINT AVERAGE POWER/RATE ALLOCATION (APRA) 

In this section, we consider the joint average power and rate 
allocation: 



? a 2 2 
and the diversity gains at both streams are equal: d A R A (r) 
dt(r) = d* 2 (r) = 

differs drastically from the APA in d20l ), where the 1st stream 
diversity gain does not improve compared to the unoptimized 
system and the 2nd stream error rate is asymptotically negli- 
gible. For the ARA, 2nd stream outage probability provides a 
sizable contribution to the system one and the system diversity 
gain is improved by the optimization. 

The first transmitter is inactive when the opposite inequality 
holds in ( f28l ). In this "low-rate" regime, R% = 2R, and the 
optimized outage probability is given by 



The optimum outage probability and rate allocation for this 
example are plotted in Fig. [2] Note that the approximations in 
Theorem 2 agree well with the accurate numerical solutions 
over the whole SNR range. 



° 10 




(a) 



□ num. ARA 
approx. ARA 




(b) 



Fig. 2. Outage probability (a) and optimum rates (b) for 2x2 V-BLAST 
with the ARA in the i.i.d. Rayleigh fading channel, closed-form R* is from 
{23}, r = 2/3. Note that the approximation agrees well with the precise 
numerical solution, and that the ARA offers a significant reduction in the 
outage probability, even at low to moderate SNR. 



min P ou t, subject to R4 = mR, Ri > 0, 



i 

E 



(33) 



a, 



m, 014 > 0, 



r out — r 2 ~ 2 "0 



(32) 



so that the diversity gain dARA{ r ) = 2(1 — r) > dAp A {r) = 
1 — r/2, i.e. a significant improvement over the APA. For 
fixed R, d A R A = 2 > dAp A = 1, i.e. the ARA enjoys the full 
diversity as opposed to the APA. 



where the total rate is mR and the total power is to, both of 
them are functions of the average SNR. The following property 
is instrumental. 

Lemma 5. In the low-outage moderate-rate regime, i.e. when 
In 2 < Ri < ln(l +70) for all active streams, the optimization 
problem above is convex. 

Proof: see Appendix ICl ■ 
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The solution can now be characterized in 
surprising way. 



somewhat 



Theorem 3. In the low-outage moderate- rate regime 1 < 
Ri < ln(l + 70), the uniform power allocation among active 
transmitters attains the minimum outage probability in ( 1331) , 



a* = 0, i= 1,. 
m 



m A 



, m — rriA 
i — m — rriA + 1, • • 



. , m 



(34) 



with 70 
in the SNR 



The optimal rates and the number of active transmitters are 
given by (1231 , d241 >, respectively, with the substitution 70 — > 
7n- 

mA 

Proof: see Appendix ICl ■ 
The APRA outage probability is given by 
replaced by ^-70, so that the APRA gains ^ 
compared to the ARA. 

According to Theorem [3] a non-uniform power allocation 
on top of the rate allocation does not bring in any additional 
advantage in terms of the outage probability. The difference 
between the ARA and APRA lies in the extra power which is 
added to active transmitters in the APRA every time one of 
the transmitters is turned off while the per-transmitter power 
in the ARA is kept fixed. A consequence of this is that weak 
streams are kept silent by the APRA for a wider range of the 
SNR compared to the ARA (i.e. threshold multiplexing gain 
is higher for the former). 

Example. Consider again the 2x2 system. As long as both 
transmitters are active, there is no difference with the ARA, 
and the optimal rates are given by (13 Oi l, and the outage prob- 
abilities are as in OTb . However, the threshold multiplexing 
gain or SNR are now slightly higher: 

' ' ' ~ (35) 



r > - + 



3 In 12 , 3 In 12 

or M70 > 



4 In 70 2(2r-l) 

This is a consequence of an extra redistributed power when 
one stream is turned off. 

When only one transmitter is active, i.e. with the opposite 
inequality in OSb . the outage probability becomes 



P* 

out 



1 -2(1- 
0T0 



(36) 



exhibits 4-fold improvement compared to the ARA in 
with one active transmitter (regardless of the multiplexing 
gain), due to the doubled power on the remaining transmitter 
in the APRA. Note also that the APRA brings in additional 
diversity gain compared to the ARA in the range 



31n3 



2 4 In 70 
because of the higher threshold 



<r<- 



1 3 In 12 



4hi7o 



The optimum outage probability and powers/rates are shown 
in Fig. [3] and |4] for various optimization strategies for r = 1. 
While the ARA and the APRA are identical for about 70 > 
15 dB, when both streams are active in either system, the 
APRA exhibits better performance at lower SNR 70 < 15 dB, 
when there is only one active stream in the latter. Note also the 
difference in optimal power allocation between the APA and 



APRA strategies: while a larger portion of power goes to the 
1st transmitter in the former, the power is distributed uniformly 
among active transmitters in the latter. The reason for this 
difference is that the stream diversity orders are equal in the 
rate-optimized system so there is no need to favor the first 
stream in order to reduce its dominating error rate. Overall, the 
APRA exhibits the best performance, the ARA approaches it 
at high SNR, followed by the APA, which still provides about 
3 dB performance improvement over the unoptimized system 
at high SNR, which is the same as the SNR gain due to the 
optimal ordering in the unoptimized system PTllfSl . 




1Cf 



1Cf 





- unoptimized 





num. APA 




— approx. APA 


□ 


num. ARA 




- approx. ARA 





num. APRA 




— approx. APRA 



20 
SNR (dB) 

Fig. 3. The optimized outage probability of 2 X 2 V-BLAST in i.i.d. Rayleigh 
fading, r = 1 , for various optimization strategies; the optimum power/rate 
allocations are as in Fig. [4] 



VI. Robustness 

Due to uncertainties or variations in system parameters and 
in the objective and constraint functions, a robust algorithm, 
which is insensitive to these variations, is desired from the 
practical perspective. In this section we discuss the robustness 
of the APA, ARA and APRA optimization strategies. 

To quantify the robustness of the optimization algorithms 
above, we employ the following measure of local sensitivity 
to a system parameter u, which may be power or rate: 



AP* Ip* 

out I ou 



Au/u 



ur out 



du 



P* 

out 



(37) 



where P* ut denotes the outage probability of the optimized 
system so that 5 is the ratio of the normalized variation in 
performance to the normalized variation in a system parameter. 
If 5 is small to moderate number, it implies that a small 
change in u leads to a relatively small change in P* ut - We 
then say that the algorithm is robust to variations in u. We 
are interested in how a small variation in u = Ri* or u — a* 
affects the outage probability. From the optimality condition, 
dP* ut /du* = v, where v is the Lagrange multiplier in the 
corresponding convex optimization problem, so that 



s = uu*/p; u 



(38) 



s 



a 1 




(a) 



1.5 



0.5 



IX) 
10 



num. APRA 
approx. APRA 



1 il Tx 



20 
SNR (dB) 

(b) 



30 



40 



□ 


num. ARA 




- approx. ARA 





num. APRA 




approx. APRA 




(c) 



Fig. 4. The optimum powers for the APA (a) and for the APRA (b), and the 
rates (c), for the same sett ing as in Fig. [5] The approximate a* are fro m (9 ) 
for the APA and from )34t for the APRA; the approximate R* are from <23l 
Note that the approximations agree well with the precise numerical solutions 
over the whole SNR range and that the ARA and APRA indeed have the same 
performance in the low outage regime. Also note the opposite tendencies in 
power allocations of the APA and APRA in (a) and (b): while the former 
allocates vastly different powers, the latter is almost uniform at high SNR. 
The SNR threshold in {35} is 70 > 16 dB and a grees well with Fig. 3(b). 



i.e. the system robustness measure is proportional to the 
Lagrange multiplier [22). 

Analysis of the APA robustness, u = a*, is done similarly 
to the uncoded systems in lfl2l . and d38l > reduces to 



Si 



1, 



(39) 



- 1 



To 



i > 2, 



where Si denotes sensitivity to on, and bi are given by ( fTOb - 
We see that the APA is robust as long as n — m is not too 
large; moreover, since 8\ S2 ^> • . • S m , higher streams 
exhibit better robustness. 

For the ARA, we are interested in how a small variation in 
Ri* affects the outage probability, u = R* . Substituting 
and d26j > into fl38l l, one obtains: 



b(n — rtiA + m — 1) 



nm 

Ri < R, 

m A 



(40) 



where mR is the total target rate, and the inequality is based 
on the observation that n — 1 < n + m — tua — 1 and 

b = YT=i n-m A +i ^ 1J ir- When R is proportional to In 70, 
which is required to obtain a non-zero multiplexing gain, and 
mn/mA is not too large, S^ is a moderate number in the 
practical SNR range. Contrary to the APA case, lower stream 
demonstrate better robustness in the ARA , since R* < R* +1 
(Lemma |3). In the case of n — m — m A , d40b simplifies to 

S Ri w R*/b < RI (41) 

where b = Y"" 1 , i > 1. 

For the APRA, it is easy to verify that the rate sensitivity 
is given by (l40b . while the power sensitivity is given by 

(n — 1)\ n 
rr < 

m A 



b(i 



m A 



1) ! 



< , (42) 



which says that the APRA is robust as long as is not too 
large. In contrast the APA, each stream here has exactly the 
same sensitivity to variations in its power. 

Example. For n = m — 2, the APA sensitivities to first and 
second stream powers are given by 



Sx w 1 » S 2 « {2{e R - l)/ 7 o) 



1/3 



(43) 



For the APRA with when both transmitters active, the sensi- 
tivities to a*, i = 1,2, are the same and equal to 2/3, and 
the sensitivities to R\, i?£ are given by 



S Rl « \ (R - ^AR^j , S R2 



(44) 



where AR — In 70 — R. When the first transmitter is turned 
off, the power sensitivity is 1 and the rate sensitivity is 62 ~ -R- 

VII. Dual Problems 

We have considered above the optimal power and/or rate 
allocations to minimize the outage probability. Due to the 
convex nature of these problems (which also have zero duality 
gap), these allocations can also be used to minimize the total 
power or maximize the total rate under the outage probability 
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constraint - another practically-important possibility. This is 
formalized below. 

Theorem 4. Consider the following problem dual to ([H): 

m 

min> oii subject to P out (a.) < e (45) 

i=0 

Its solution a is the same as that of the problem in (0 and, 
in particular, is as in (|9]l under the conditions of Theorem Q] 
for e = P ou t{oL*\ 

Proof: see Appendix ID1 ■ 
Similar duality property hold for the problem in (f2Tb . which 
is formalized below. 

Theorem 5. Consider the following problem dual to ( T2T1 ): 

m 

max^ i?;, subject to P out (R) < e (46) 

R i=0 

Its solution R° is the same as that of the problem in (I2U so 
that Theorem\2\applies with e = P ou t(R*). 

Proof: along the same lines as that of Theorem [4] ■ 

VIII. Conclusion 

A comparative analysis of the optimum power, rate and 
joint power and rate allocations for the coded V-BLAST 
has been presented. All considered optimization strategies 
use the average channel statistics rather than instantaneous 
channel matrix. This has the advantage of lower complexity 
due to less frequent feedback sessions and lighter computation 
load yet offers a significant improvement in performance. 
Compact, closed-form expressions for the optimum allocations 
of average power and rate have been given. The average rate 
allocation is the most rewarding strategy in terms of incremen- 
tal improvement, since it improves the system diversity gain. 
In contrast, power allocation can at most give an m-fold SNR 
gain and it does not provide any additional diversity (provided 
that all streams stay active, as required for high spectral 
efficiency at high SNR). In the low outage regime, the uniform 
power allocation is already optimal for the coded V-BLAST 
with rate allocation, and using non-uniform power allocation 
does not improve the outage probability. This is very different 
from the power allocation only, where most of the power goes 
to the 1st stream with progressively smaller fractions to higher- 
order ones. All these optimization strategies are shown to be 
robust that makes them good candidates for practical systems. 
Dual problems of minimizing the total power or maximizing 
the total rate under the outage probability constraint are shown 
to have the same solutions as the primal ones. 
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Appendix 

A. Proof of Theorem Q] 

It is straightforward to verify that this is a strictly convex 
problem and thus has a unique solution; the KKT condi- 
tions are necessary and sufficient for optimality. Taking the 
derivative with respect to a; of the Lagrangian L(a) = 
Pout{a) + A (Y^T oti — in), where the approximation in (|6]l of 
P out is used, and equating it to zero according to the method 
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of Lagrange multipliers, we obtain 



PL (a*) 
da* 



, n—m-\-i 



■kin— m+z+l in— m-\-i 



^- m+l {n~m + i- 1)! 



+ A = 0. 
(47) 



Observe that all of the a*i can be expressed via the single 
parameter A. Expressing a* via A and substituting the result 
into the total power constraint ai ~ TO ' one obtains a 
polynomial equation for A, which can be approximately solved 
using the Newton-Raphson method. The solution follows along 
the same lines as that for the uncoded system in Appendix A 
of lfl2ll where further details of this method can be found. □ 
The following Lemma shows that an optimization preserves 
the accuracy of an approximation of the objective function, 
which justifies using (O instead of the true outage probability. 

Lemma 6. Let fix) and fix) be true and approximate 
objective functions, within ±e% of each other, 

/(») 

/O) 

Then, their optimized values are also within ±e% of each 
other, 



miru fix) 
1- e < < 1 + e 



(48) 



min x f(x) 

i.e. the original accuracy is preserved by optimization l1 

Proof: Let x* and x* be the true and approximate optimal 
points, 

x* = argmin/(x), x* = argmin/(x) 

X X 

Note that x* ^ x* in general. Observe however that 
f(x*)/f(x*) < 1 + e. Indeed, assuming that the contrary is 
true, one obtains 

fix*) , fix*) 

— — - > 1 + e > — — - 

fix*) ~ fix*) 

from which is follows that fix*) < fix*), which is impossi- 
ble. Similar argument proves f(x*)/f(x*) > 1 — e and thus 



B. Proof of Lemmas [2] \3\ and Theorem \2\ 
The Lagrangian for this problem is 

m / m \ tci 

L = J2 P i - V (J2 R i~ mR ) -J2 X * R * 

i \ i / i 

Approximating P out according to ©, one obtains the KKT 
conditions as follows: 

P* > 0, R i = mR ' Xl ^ °' XiR *i = 

i 

P-iR*) -Xi-u = 0, i = l...m (49) 

5 The authors have originally proved a weaker version of this Lemma. It 
was then noted by E. Telatar that the present version may be true. 



where P!(Ri) denotes the derivative of Pi with respect to Ri, 
Pl(Ri) « — ; , . - ■ (50) 



7 ™" m+l (n-m + i- 1)!' 



Proof of Lemma |2] (convexity): Notice that the second 
derivative of Pi with respect to Ri is positive for Ri > 0, 



e^(e^-l) 



n— m+2— 2 



[(n-m + i)e Ri - l] 



> 0. 



Pi(Ri) = 

1 V ; ^- m+ \n-m + i~ 1)! 

It follows that P out is convex as a sum of convex functions, 
and since the constrains are also convex, the problem in (BTI ) 
is convex. Therefore the optimum rate allocation is unique in 
the low outage regime. ■ 
Note that in the KKT conditions d49l ), Xi is a slack variable, 
which allows us to rewrite d49l as: 



PUR*) - v > 
[P{(Rt)-v]Rt = 



(51) 



Proof of Lemma\3\( optimal rates are ordered): Provided 
that all R* are strictly positive, the optimum rate R* is the 
point at which the slope P- is the same for all i, P((R*) = v, 
as seen from from the KKT conditions (f5Tb . Let us fix 70 
and Ri = R and consider P- as a function of i. Observe that 
P[ +1 (R) - Pl(R) 7o( f_-^ +i) < P[(R) as long as e R ~l < 7o , 
which is satisfied in the low outage regime. In other words, 
P! is monotonically decreasing in i. On the other hand, recall 
that P{(R) as function of R is monotonically increasing. We 
conclude that to achieve the equality = P{(Ri), 

it is necessary that Ri+\ > Ri for positive Ri. 

Suppose now that for some i, R* > but R* +1 = 0. Ac- 
cording to the KKT conditions ( Bit , in this case > 
v = P/(iZ*). But from the monotonicity properties of P[ 
discussed above, P{ +1 (R$ +1 = 0) < P/(0) < P{(R* > 0) - a 
contradiction. We conclude that R\ < R2 . . . < R m -i < Rm 
and it holds with equality between Ri and if and only 

if Ri = R i+X =0. ■ 

Proof of Lemma |4] ( diversity order): it is easy to verify 

from ©, m that P[ = ^'X-f' P * ~ (n - m + »)-?<■ 
But at the optimum in OTb . the slope P[ is the same for all 
active transmitters, v = P!* rj (n — m + i)P*, R* > 0, so 
that the optimal stream error rates P* are scaled versions of 
one another. ■ 
Proof of Theorem\2\(closed form rates): From the KKT 
conditions in (|5B we see that if R* > 0, then P/(P*) = v, 
or 



l) n m+l 1 = u^- m+i (n -m + i- 1)!, 



Let us employ the moderate to high rate approximation in the 
LHS, e R * (e R < - i)™-™^ 1 ~ e (n-m+i)ii« ( which becomes 
equality for n — m + i = 1), and take the logarithm of both 
sides: 

R* = 1 —— [In M~ m+< (n-m + i- 1)!}] (52) 

n — m + i L L ' J + 

where the operator [-] + = max[0, •] accounts for the case R* = 

0. While this approximation becomes less accurate for Ri < 

1, at least one of the R* satisfies R* > R, and hence the 
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approximation is accurate for at least one of the Ri if the 
total rate is not too small, mR > 1, The approximation error 
in smaller R* does not have as much impact on the total outage 
probability so that the over accuracy is good. The Lagrange 
multiplier v is found by substituting R* from ( 1521 into the 
total rate constraint Ri — mR: 

g In v + In [7j~ w+i (n - m + i - 1)!] 



mi? 



i=m— m^ + l 



f = 7o 



n — m + i 



(53) 



C. Proof of Lemma \5\ and Theorem fi] 

Proof of Lemma [5] (convexity): Consider the following 
function: f(R,a) — (e — l)/a. Its Hessian 



A/ 



Va 2 



-e R /a 2 2(e R ~l)/a\ 

is positive definite at the moderate to high rate regime, 
R > In 2, since the diagonal entries are positive and the 
determinant is also positive, det A/ = e R (e R — 2)/a 4 > 0. 
Hence, / (R, a) is jointly convex in (R, a), Pi is convex by 
the composition rule ll22l . and P out is convex as a sum of 
convex functions. Since the constraints are also convex, the 
APRA problem is convex. ■ 
Proof of Theorem fi] (closed form solution): The La- 
grangian for this problem is 



E 



v a \ 7 a l -m 



i \ i / \ i / 

m m 

i i 

and the KKT conditions are given by 

R* > 0, R i = mR ' Xr * - °' XR * R i = 

i 

<%i > 0, a* — m, A Q . > 0, A Qi a* = 

i 

P!(R*) - X Rt -is R = 0, -Wp. - \ at - Va =0, (54) 

a i 

Here, P? denotes derivative of Pj with respect to Ri, and we 
have exploited the fact that 

dPi _ e fl * - 1 1 dPi ^ 1 dPi 
don e Ri oii dRi ai dRi 

when 1 < Ri < In 70, as easy to verify from (0. As before, 
\ Ri and X ai are slack variables and can be eliminated from 
454): 



pm) -pr>o 
-zm- Va > 



(55) 



Expanding X^R* = 0, A Qi a* = while using d54i i. one 
obtains: 



\P((Rt)-v R )R$ = 
.(-^P-^)< = ° 



(56) 



If i-th transmitter is active, i?* > 0, vr — P^(R*) and a* = 
—P!(Rt)/v* = —vr/voi does not depend on i. We conclude 
that the uniform power allocation among the active streams is 
optimal. According to the total power constraint, the amount 
of power allocated to each of the active streams is equal to 
m/rriA, where wia is the number of active streams. Hence 
the optimal rates are given by the same expressions as in the 
APvA in d2TI ). with 70 changed to ^j7o- ■ 

D. Proof of Theorem [4] 

A formal proof can be constructed using Lagrange duality 
theory l22l (using the fact that the duality gap is zero in 
this case). A more simple and straightforward proof is by 
contradiction. Assume that ct* solves ([8]) (the solution is 
unique since the problem is strictly convex) and that a° solves 
(|45| | with e = P ou t(a*). Since P ou t is strictly decreasing in 
ai, the solution in d45l is on the boundary P ut = e so that 
YZ Q o%<YZ <=m. Define 



T m a 



> 1 



(57) 



so that 



PoutiPa ) < e = P „ t (a*) (58) 

If strict inequality holds in (l58l , then there is a contradiction 
since a* is the minimizer of P ou t- If equality holds in d58l and 
f3a° ^ a.*, then there is a contradiction since a* is the unique 
minimizer. The only remaining possibility is f3a° = ex* so that 
a° = a*. 
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